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A  GEOMETRIC  INTERPRETATION  OF  lACg^AWGE  MULTIPLIERS* 

R.  C.  Kao"^* 

The  RAND  Corporation,  Santa  Monica,  California 


A  fundamental  assumption  ccnmon  to  all  econoalc  analyses  is  the 
maximization  or  minimization  of  an  objective  function  (representing, 
say,  utility,  cost,  welfare  or  the  like)  subject  to  certain  con¬ 
straints.  Statements  of  the  type:  ”A  consumer  with  given  income 
maximizes  his  total  utility  only  if  his  marginal  utilities  for  the 
various  ccomodlties  are  proportional  to  their  prices,”  are  almost 
commonplace  in  economic  texts  and  are  generally  described  as  "equilib¬ 
rium  conditions"  of  the  economic  process  under  consideration.  Never¬ 
theless,  when  these  meaningful  economic  theorems  are  presented  to 
even  the  more  advanced  students,  the  argument  is  usually  shrouded 
with  a  complete  or  partial  m^-’stery  around  the  so-called  Lagrange 
multipliers.  Very  little  explanation  is  given  to  these  multipliers 
themselves  except  that  they  are  the  coefficients  used  to  form  a  cer¬ 
tain  Lagrangian  function,  the  extremization  of  which  leads  to  the 

I  am  indebted  to  Professor  A  .  A  .  Alchlan  for  calling  my  atten¬ 
tion  to  this  problem. 
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tieslred  equilibrium  conditions.  This  paper  Is  devotedl  to  a  pedagogic 
clarification  of  the  Intrinsic  oeaning  of  these  oultlpllerc  themselves 
aiid  a  natural  reformulation  of  the  equilibrium  conditions  vhlch  per¬ 
mits  a  better  insight  into  the  nature  of  constrained  extremum  probleLis 
In  econmlca. 

I^t 


(1) 


/  «  f(x) 


be  a  real-valued  function  of  a  single  variable  x.  The  function  f  may 
represent  the  short-run  cost  curve  of  a  production  process  vith  only 
one  variable  factor.  If  f  Is  sufficiently  smooth  (l.e.  x  is  infinites¬ 
imally  divisible)  a  necessary  condition  for  a  (relative)  minimi of  (l) 
is,  as  is  veil  knowni 


(2)  ^  =  0  . 


and  a  sufficient  condition  for  a  (relative)  mlnlmimi  of  (l)  is  (2)  plus 

(3)  ^  ~  fHx)  >  0 

.  _  dx  dx 

# 

Cf.  inter  alia  the  following  veil  known  econcmic  texts:  R.  G.  D. 
Allen,  Mathematical  Analysis  for  Econcmists,  London:  MacMillan,  19^9, 
p-pi  366--^36f;  idem,"  Mat  hectical  'Econcriics  ^"London:  MacMillan,  195^^ 
pp.  610,^  6lk;  D.  W.  Bushaw  and  R*  W.  Clover,  Introduction  to  Mathematical 
Econcmics,  Homewood,  Illinois*  Irvin,  1957 j  P-  331;  J-  M.  Henderson  and 
E.  Quandt,  Microeconcmlc  Theory,  A  Mathematical  Approach,  New  York: 
McQ^aw-Hill,  19 5^,  pp'."^ 27 3 -27 4*"  J,  R.  Hicks,  Value  and  Capital,  2nd  ed. , 
London;  Oxford,  19^,  P-  303$  P-  A.  Samuelson,  Foundations  of  Economic 
Analysis^  Cambridge,  Massachusetts:  Harvard,  pp.  3^2- 3^5;  ancL  Taro 
Yamane,  Mathematics  for  Econcmists,  Englewood,  New  Jersey:  Prentice-Hall, 

1962,  pp7  116-123- 
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5  • 

Geometrical!;/,  (2)  states  that  the  taij^nt  vector  to  the  cur^/e  C 
defined  by  (1)  must  be  horizontal;  and  (3)  states  that  it  is  in¬ 
creasing  in  slope  around  aay  root  of  (2).  (Figure  1) 


y 


Figure  1 


A  more  easily  generalizable  geometric  interpretation  of  (3)  is  the 
following:  Any  function  f  possessing  sufficient  number  of  derivatives 
(l.e.,  sufficiently  smooth)  may  be  expanded  into  a  Taylor's  series: 


f(x)  =  fCx^) 


5“  V  X  — 
dx 


X  ) 


iQ^f, 

*  5—^*  - 

ax 


X  ) 


n: 


d“f 


:<  -  X 


ax 


wheire  all  derivatives  are  to  be  evaluated  at  That  is-»  the  value 

of  f  at  X  may  be  represented  by  its  value  at  ,  together  vith  all 

O  0 

derivatives  of  f  at  x  .  Consequently,  if  x  is  to  be  a  relative  mini* 
mum,  all  sufficiently  close  nel^borin^;  x  must  not  yield  a  smaller 
y  =  f(x),  i.e. 

f  (  X  )  -  f  (  X  °  )  =  , 


(5) 


0^  Im  ‘ttwrma  at, 


,-4- 


o 


2  /  ®  '' 

/,x  1  d  f  .  0.2  >  ^ 

(6)  ^  ^  (x  -  X  )  «  0 

dx 

•luce  at  x^,  ^  a  o;  and  If  x  ±b  close  to  x°,  the  term 

shovn  in  (6)  vill  dominate  the  eoaibilied  effett  of  all  other  terms  In 
the  expansion  (4)  because  all  rvaalnlBg  tenui  involve  x  -  to  a 
higher  order.  That  (6)  la  eq^irmlMlt  to  (3)  ts  obvious. 

If  f  is  ncfv  a  function  of  tvo  Inda^nde&t  variables,  (l)  may  be 
rewritten  as 

(7)  y  =  »2> 

and  a  pair  of  neeesaary  eonditioaa  emn.Tpondin^  to  (2)  are 

(6)  ^  m  t  m  0  ,  ^  S  f  ^  ©  c. 


These  eonditiona  state  that  the  taa^snt  vectors  l©  the  surface 
defined  by  (?)  la  the  directions  of  Increasing  and  x^  must  be 

horixontal,  that  Is,  parallel  to  the  x^^x^  plane.  (Figure  2)  If 

'0 

f  Is  sufficiently  ^nooth.  Its  Taylor  expansion  around  any  root  x 
of  (8)  ip  given  by 


»%/  \  <%/0  0\  /■  ^  \ 
f{Xj,  X^)  =  Xj)  +  —  (x^-x^/  +  —  (Xj-X^) 


fgt 


1 

*  ? 


7?  Cxj,-Xi>  .  S  Jx,dx, 


a^f 


ax 


1 

a*f  0,2 


i"  2 


n 


) 


an  argument  similar  io.  use^  t9,  derl'^^  i  <raffleleil^  etal‘=^ 

dltion  .for  to  ba  a  (rvlAtiva)  niia.t!Tnim  Is 

2  ^  ♦  ||  ^  0 

tht  taiiflent  rectora  {dx^,  0)  -  (x^  -  x^,  O)  aad  (0,  dx^)  ■  (0,  -  x^^ 

to  tha  surface  (7)  at  determine  a  2-diJDenaloQal  tangent  plane 
^12  ^  ®12*  Since  x®  is  to  be  a  relative  mlninumg,  all  auffieiently 
close  ncis^boring  points  must  not  yield  a  smaller  points  on  the 
tangent  (dx^  ©),  (0^  dx^)  being  only  speeial  oaseso  More  generally, 
points  on  any  vector  dx  -  (dx, ,  dx^)  at  vtileh  is  a  linear  combina¬ 
tion  Oi  Idx^,  0)^  (O,  dx^)  must  also  not  yle^d  a  amailer  y.  Since 
(dx^j»  o),  (O,  dx^J  span  or  form  a  basis  of  dS^,  dx  may  be  represented 


[dx  ^  dx^)  =  cos  a 

1  <-  1  . 


#  3  ,  3X 


where  cos  cos  are  the  direction  cosines  of  dx  with  respect  to 
the  local  coordinate  system  on  with  origin  at  x°.  Ccnse<iuently^ 
a  strengthened  necessary  condition  for  a  relative  ai/iir.^  mt  x°,  which 
Includes  the  tvo  equations  in  (8)  as  special  cases, 


V  f 


3  f  „  .  3  f 

=  :  -  COB  O-  -r  r— 


-  ax 


where  (cos  a  cos  02)  ere  the  direction  cosines  of  an  arbitrary  tangent 
weetor  dx  in  x®.  V^f  is  called  the  directional  derivative  of 

f  in  the  direction  dx.  Also,  a  strengthened  sufficient  coedition  for 
%  relative  Tnii>iin.ui  at  x^  is  ,  by  taking  th@  direc derivative©  of 
T  i\  again  in  direeticn  dx, 

X  ^  Xp 


for  the  dl««en  «sbltraiy  tangeai  fli  In  afe  x^- 

TbA  taziftnt  plane  ttt  any  ^lat  1c  OB  lo  4ef the 

llnoor  terms  in  ths  txpanfliofi  (9)#  i-e.. 


(1^) 


y  -  X2) 


1  *  3*2 


X 


I 


vbsre  (x^^,  Xg,  y)  i»  s  point  in  ^  dtrl^tifUi  m% 

to  he  €7slusted  at  x.  To  put  the  Hotter  diTfeirentiy^  :Lf  5^^  itaelT 
it  already  a  plaae*  then  the  expansion  (9)  ot  any  point  on  it  sust 
he  exact  vlth  only  the  linear  tezas,  i.t. ,  aH  hi^tr-ordtr  teres 
BRiflt  vaniah  identically,  nie  nortnal  to  tht  taastat  plant  dS^  at  1^^ 
alto  called  the  gradient  vector  yf  to  8^^  at  1c,  hat  eospostots  pro^ 
portional  to 


f^c\ 

j 


ill  H 


-1) 


At  a  relative  Hiiaimim  point  x^  on  s^,  (8)  holds  and  thus  in  (ll|/ 
y  •  f(x^,  x^)  IdenticsUy,  vhieh  it  another  viy  of  saying  that  dS^^ 
at  x  is  parallel  to  the  x^  -  plant  (calltd  the  bass  plane}  i§od 
at  distance  f (x^,  x®)  fpon  It.  At  an  arbitfmzy  point  a?  on  th© 

left  side  of  (ih)  need  not  yaniahi  ao  viU  the  ri|^t  aide  not  also. 
But  %m  right  tide  of  ilk)  It  tha  tame  tt  y^f  defiiied  in  (laj  if  ve 
choose  a  point  (x^,  y)  in  at  x  such  that 


'COS 


Bine?  (eosa^ji  c<?s  a^,)  represents  a  unit  vector  vith  respect  to  the 


l^al  coorUaate  loratazn  in. 


f  is  pre- 


glseXr  the  cempooent  (l»e.  projfect-ioaj  of  vf  in  the  direction  dx. 

(12)  atetea,  therefore,  that  at  a  critical  point  on  the  pro- 
ieeticn  of  the  sradient  vector  ?f  In  every  dixectlco  dx  m 
and  (lU)  ahevs  that  at  a  noneritical  point  x  of  8^,  the  projection  of 
Vf  on  dSji^  seed  not  vanlih  for  all  directions  dx  in  dS^*  Thi#  te* 
iiult  applies  generally  to  opeeea  of  dlnensions  greater  than  2. 

On  the  basis  of  the  above  gecnetrle  coneepta,  it  is  now  poa* 
slble  to  give  an  intrinsic  eharacterixation  of  Lagrange  BiltipXiera* 
roiulder,  for  example,  a  constrained  problem  of  the  foUoving 

type:  MhniBlse  (7)  subject  to 


(17)  g(x^,  x^)  «  0  • 

(17)  defines  a  curve  in  the  base  plane,  and  fnininw  of  f  is  to  be 
aou^t  aaong  all  points  x  •  (x^,  x^)  lyln^  on  this  curvt  C.  At  aaQT 
such  (relative)  mlniaaai  point  x^^the  directional  derivative  Vf  of  t 
along  the  tangent  to  C  auat  vanish  by  (12),  vhere  cos  cos 

denote  the  cemponents  of  the  unit  tangent  dx  to  C  «t  X®.  However,  (rf| 

shows  that 

Vdx*  2  ^  ® 

A  te 

also  at  this  point.  Consequently,  ^  ^ be  collinear^ 

i.e.  ^  fat  SQBlf'  scarcer 
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(19)  ,  ^ 

dx  U  tixe  tao^gurt  W9ttot  to  C  defined  by  (17).  (19)  ie 

^qolvalMt  to 


(80) 


which  0X0  the  ueuol  eoadltloas  derlwable  ffca  dirfereotlation  of  tbt 
Lagranglan  function.  A  fufflcleat  eonditlooi  for  a  relative  mlnlmat 
at  is  (13)  vith  cosa  eosa^  beine  SigBiin  the  cooponeats  cf  dr 
(the  taagant  to  C  at  x^)  vith  respect  to  the  local  coox^dlxiate  sygtem 
at  r®. 

Generallsstlon  of  the  above  gectietrlc  characterization  of  Lagrange 
miltlpliers  to  spaces  of  higher  diaezxsions  Is  Isviediate.  Let 


(21)  y  *  f(x^^ 

again  denote  the  objective  fonction  to  he  extremlted,  and 

(22}  g  X  f  (j*l* 

J  i  2 

denote  a  set  of  Irkiependent  side  constraints.  Each  g^  defines  a 
liypersurface  8^  in  the  base  plane  (i.e.,  the  (x^  — ,  x^)*plane  in 
(n  *  l)-dlBensiaiial  space  with  the  last  axis  y).  The  intei?^ 
section 


(23) 


sf  these  taypersiirfaees  in  genaml  yields  an  (n  «  r)»djAeasloaal  sur¬ 
face  Is  the  base  plane  ,  ^  a  ieritieal  point.,  ®  on. 
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a  tangent  space  generally  exists  vlth  basis  vectors 

(dx)^,  — ,  and  the  directional  derivative  vf  of  f  along  each 

such  basis  vector  must  vanish-  This  says  that  yf  mustf  be  orthogonal  to 

to  _ orthogonal  ccmplement  Of 

'^12- ••r  chows  that 


r  .  .  dx  r  G 

lal  dX.  i 


(j  =  1* 


also  at  x^.  Hence,  if  dx  «  (dx*  dx^,  is  chosen  td  range  oYer 

1  ^  ^ 

the  basis  vectors  (dx)  ,  — ,  (d:-t5  ;f  merely  ehOVf 

that  each  ^g.  (j»l»  2,  •— ,  rj  -ilsc  crthogonal  to  dS._  *  3ut 
gi  ••-»  r)  are  independent,  /.  .  would  form  a  basis 

4f  -  • 

for  dS^  since 

(85)  SiaSS,  -  .  ,  dini  dS,'!;  =  n 


at  any  regular  point  oa  .  Therefore*  for  some  scalars  \- 

'  1, 

we  must  have 


(86) 

which  gives  in  ecppOnent  form. 


(£7) 


II 


These  form  a  system  of  n  equations  In  n  +  r  unknowns 

\  **•-,  ^  .  But  since  (x,,  — .  x  /  must  also  satisfy  (22),  r  ad- 

u ,  r  u  u 

ditional  equations  are  to  be  added-  Consequently^  the  Lagrange  multi¬ 
pliers  are  merely  coefficients  used  in  expressing  a  certain  linear 
dependence  relation  among  the  gradient  vectors  to  f  and  gj's* 


-11- 


The  sufficiency  condition  is  also  easily  generEilized.  With  re¬ 
spect  to  the  basis  vectors  (dx)^,  (dx)^"^  of  a  typical 

unit  tangent  vector  dx  in  "tlie  form 

(28)  dx  =  cos 


where  cos  a.,,  — cos  a  are  the  direction  cosines 
1  n-r 

of  dx  with  respect  to 

(dx)\  — ,  (dx)*^'^.  Ihen 


{39)  v"  f  =  7  (V  f)  .  ^  cos  aj 

dx  dx  dx  ^  r  j 


“j  >  0 


together  with  (27)  yields  a  relative  constrained  minimum  at  x^®  Al¬ 
ternatively,  if  z  «  (z,  — ,  z  )  is  any  vector  In  the  hase  plane,  a 

i.,  n 

relative  constrained  minimum  at  a  point  Is  assured  by  (27)  and 


»  ...  I  3F.  ) 

1  A 


/  ti  ^  *  A 

7  \ 

'A 

« 

i 

• 

\  '^V^i  ’  '  '  / 

'  i 

\'^^i 

*  ^ . 


for  all  z  orthogonal  to  'Hint  is,  for  all 

satisfying 


r«>n  ^  J 

1  c 


.©«,  'F ) 


(31) 


-  1.1 


'ttiat  (30)  and  (31)  i^ay  t>e  translated  into  appropriate  properties  of 
the  bordered  Hessian 


(32) 


.  .  0 

»6i 

0  .  . 

•  •  •  •  ^ 

0  .  . 

.  .  0 

•  •  •  •  ax^ 

dx*^ 

. 

. 

. 

air 

*  ’  ^ 

ax_^3x. 

dx^ 

n 


may  also  be  readily  established. 


